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The Higgs boson in the standard model (SM) breaks 
the electroweak SU{2)l ® U{1)y symmetry. While this 
symmetry is now well established, the mechanism of the 
symmetry breaking has not yet been identified. Direct 
searches at the LEP experiments have set a lower limit on 
the Higgs boson mass of 114.4 GeV/c^ at 95% C.L. in 
the context of the SM pj. Precision measurements pro- 
vide the indirect upper limit niH < 144 GeV/c^ at 95% 
C.L. through radiative corrections to the SM predictions 
of the particle masses and couplings 2]. However, these 
indirect limits assume no significant contributions to the 
radiative corrections due to as-yet unobserved processes. 

In this letter, we report a search for the process gg — > 
in a 3.0 fb ^ integrated luminosity sam- 
ple of pp collisions at ^/s = 1.96 TeV produced by the 
Fermilab Tevatron and collected by the CDF H detec- 
tor. For an SM Higgs boson with a mass that is not 
directly excluded by the LEP experiments, the dominant 
production mechanism at the Tevatron is gluon-gluon fu- 
sion which proceeds via a virtual top quark loop Q . For 
niH > 135 GeV/c^, the SM Higgs boson decays primarily 
to the WW* [3|, where one of the final state W bosons 
is virtual for tuh below two times the W mass. 

The events are reconstructed in the llvv final state, 
whose branching fraction is 6.0% of the WW* decays, 
where I is either an electron e or a muon fi, including 
those from r leptons produced in the W decays. The SM 
Higgs boson branching fraction to WW* varies from 7.5% 
at 115 GeV/c^ to 73.5% at 200 GeV/c^ with a maximum 
of 96.5% at « 170 GeV/c^ % Previous searches set 
limits ranging from 10 to 40 times than the predicted 
SM rate, depending on the value of uih [I]- 

The CDF H detector is a multipurpose solenoidal spec- 
trometer surrounded by calorimeters and muon detectors 
[H- The geometry is described using the azimuthal an- 
gle 4> and the pseudorapidity ?/ = — ln[tan(6'/2)], where 9 
is the polar angle with respect to the proton beam axis 
(positive z-axis). The transverse energy Et is _Bsin^, 
where E is the energy associated with a calorimeter el- 
ement or energy cluster. Similarly, pT is the track mo- 
mentum component transverse to the beam line. 

The events we consider must pass one of four online 
selections, triggers, before being recorded. One electron 
trigger requires an electromagnetic (EM) energy cluster 
in the central (|?7| < 1.1) calorimeter with Et > 18 
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GeV matched to a track found in the drift-chamber with 
Pt > 8 GeV/c. A second electron trigger requires an 
EM energy cluster with Ex > 20 GeV in the forward 
(1.2 < I77I < 2.0) calorimeter and the missing trans- 
verse energy l^j, > 15 GeV. The variable l^j,, used to 
infer the presence of neutrinos, is defined as ET,i nx^i 

and l^x = I^tIi where hx^i is the transverse component 
of the unit vector pointing from the interaction point 
to calorimeter element i. Muon triggers are based on 
track segments in the muon chambers matched to a drift- 
chamber track with px > 18 GeV/c. Trigger efficiencies 
are measured using leptonic W and Z data samples [61]. 

To improve the signal acceptance while maintaining 
acceptable background rejection for the M^-|-jets and Wj 
processes where a jet or 7 is misidentified as a lepton, we 
use a modified version of the lepton identification strat- 
egy developed for the WZ observation analysis . Can- 
didate leptons are separated into six mutually exclusive 
categories: two for electrons; three for muons; and one 
for tracks that extrapolate outward to detector regions 
with insufficient calorimeter coverage for energy measure- 
ment. The electron categories are central {\ri\ < 1.1) 
using a drift-chamber-based tracking algorithm and for- 
ward (1.2 < \r]\ < 2.0) using a siHcon-detector-based 
tracking algorithm. One of the muon categories uses the 
muon chambers and other two use tracks matched with 
energy deposits consistent with minimum ionization in 
the central or forward calorimeters. 

All lepton candidates are required to be isolated such 
that the sum of the Ex for the calorimeter elements in 
a cone of AR = y^{Ari)^ + {A(f>)'^ < 0.4 around the lep- 
ton is less than 10% of the Ex (for electrons) or px (for 
muons and track lepton candidates). For lepton types 
in the central region where the track finding efficiency is 
sufficient, we also apply a track-based isolation criterion 
which requires there is no more than 10% of the electron 
Ex or muon px in other tracks within a cone of AR < 0.4 
around the lepton track. 

The Higgs boson candidates are selected from events 
with exactly two lepton candidates. At least one lepton 
is required to match a trigger lepton candidate and have 
Ex > 20 GeV {px > 20 GeV/c) for electrons (muons). 
We loosen this requirement to 10 GeV (GeV/c) for the 
other lepton to increase the kinematic acceptance, par- 
ticularly for lower niH where one W is off-shell below the 
W boson mass and produces a lower px lepton. We also 
require a dilepton invariant mass mu > 16 GeV/c^ to 
suppress misidentified multijet events. 

Aside from H WW'^*^ production, other SM pro- 
cesses that can lead to two high-pr leptons include Drell- 
Yan (DY), tt, WW, WZ, and ZZ production, and W 
production in association with a photon (Wj) or a jet 
(VF-Hjets) misidentified as a lepton. The tt contribution 
is suppressed by requiring fewer than two reconstructed 
jets with Ex > 15 GeV and jr/j < 2.5 in the event. This 
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allows for up to one jet due to inital state radiation. The 
DY background is suppressed by requiring sufficiently 
large in the event to remove contributions from mis- 
measured leptons or jets. This is achieved by requiring 
$T.rei > 25 GeV, where 



TABLE L Expected Higgs boson yield as a function of ran- 



if ^4>$^. 



$T sin A0^^ (£ jot) if A0^^ 



(^,jot) 



> - 

^ 2 

< - 

^ 2 



and A0^_^ jet) is the angle between the direction and 
the nearest lepton or jet. The observed $:r^ is corrected 
for muons and track-only lepton candidates, because they 
do not deposit all of their energy in the calorimeter. 

The acceptances for the H WW^*\ WW, WZ, ZZ, 
W^, DY, and it processes are determined using simulated 
data. Events are simulated with the mc@nlo program 
for WW d], PYTHIA for H WW'^*\ DY, WZ, ZZ, and 
it 0, and the generator described in Ref. [13] for Wj. 
The response of the CDF II detector is then estimated 
with a GEANT-4-based simulation ll| to which an effi- 
ciency correction of up to 10% per lepton is applied based 
on measurements of the lepton reconstruction and identi- 
fication efficiencies using observed Z £'^£~ events. An 
additional correction is applied to the Wj background es- 
timate based on a measurement in data of the efficiency 
of requirement that supresses the fraction of photon con- 
versions identified as electrons. The M^-|-jets contribution 
is estimated from data by extrapolating from a sample 
of events that contain an identified lepton and a jet ji 
containing a track or EM energy cluster similar to those 
required in the lepton identification. The contribution of 
each event to the total yield is scaled by the probabil- 
ity that the ji is identified as a lepton. This probability 
p{pT',ji) is determined for each lepton type as a function 
of the ji pt using multijet events collected with jet-based 
triggers. A correction to p{pT\ji) is applied for the small 
real lepton contribution using Monte Carlo simulation of 
single W and Z boson production. 

Based on the procedure above, we expect 768±91 back- 
ground events and observe 779 in the selected region. The 
expected composition of the background is 356 ± 49 WW, 
24.9 ± 3.9 WZ, 21.8 ± 3.5 ZZ, 25.5 ± 5.0 ti, 138 ± 31 
DY, 90.5 ± 24.1 W-i, and 111 ± 27 W+iets, where the 
indicated uncertainties include the systematic uncertain- 
ties described below. As a cross-check of the background 
model, we measure the fraction of qq WW events in 
the sample with a similar method to the signal extrac- 
tion described below and find it to be consistent with the 
expectation. The largest predicted SM Higgs boson yield 
is 11.6 events at TOi/ = 160 GeV/c^ which results from an 
acceptance of 16%. Table |T] shows the dependence of ex- 
pected yield on mn- Since the level of non- WW'^*^ back- 
ground depends on the lepton identification categories, 
this information is used to divide the sample into high 
and low signal-to-background (S/B) classes. 

After selection, the dominant background is the qq 
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WW process which differs from the signal process only 
by spin, production mechanism, and resonant structure. 
Because of the two neutrinos in the final state, a sim- 
ple mass peak cannot be used to isolate the resonant 
gg ^ H ^ WW^*'^ process from the backgrounds. In- 
stead, we combine two different multivariate techniques 
to differentiate signal and background. One is a matrix 
element (ME) technique, which uses an event-by-event 
calculation of the probability density for each contribut- 
ing process to produce the observed event. If all details 
of the collision properties and the detector response are 
modeled in the ME calculation, this method provides the 
optimal sensitivity to the signal. However, there are sev- 
eral approximations used in the calculations: theoretical 
differential cross sections are only implemented to leading 
order, a simple parameterization of the detector response 
is used, and for some small ( WZ and it) or difficult to 
model (DY) backgrounds, we do not calculate a probabil- 
ity density. In order to improve these approximations, we 
extend the ME calculation with a neural network (NN) 
which exploits the more complete model implemented in 
the simulated data and IV-|-jets model. 

The event probability density for the ME method is 



dy 



where the elements of y (xots) are the true (observed) 
values of the lepton momenta and l^rp daT.n/dy is the 
parton level differential cross section [l2], e{y) is a pa- 
rameterization of detector acceptance and efficiency func- 
tion, and G{xobsiy) is the transfer function representing 
the detector resolution and a PYTHiA-based estimate of 



transverse momentum of the 



system due to the ini- 



tial state radiation. The constant (a) normalizes the 
total event probability to unity. This calculation inte- 
grates the theoretical differential cross section over the 
missing information due to two unobserved neutrinos in 
the final state. Wc form a likelihood ratio discriminant 
which is the signal probability divided by the sum of sig- 
nal and background probabilities LRfj^y^y^r(,){xobs) = 
PhI'S t, ^+^°'fc P (g i, ) where h are the expected back- 
ground fractions of WW, ZZ, W^, and M^-|-jets. The LR 
distributions are shown in Figure [H a). Additional ME 
likelihood ratios LRwwj LRzz, LRwy, and Li?vK +jcts 
are defined analogously to Li?^^^^/(.). 

For the final results, an NN discriminant is used to ex- 
tend the ME calculation using as input the ME likelihood 
ratios in addition to various kinematic variables. For each 
of the Higgs boson masses investigated, an NN is trained 
on signal events and an appropriately weighted compo- 
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FIG. 1: The likelihood ratio distribution from a) the ME 
discriminate alone and b) the full NN score for mn = 160 
GeV/c^ The Higgs boson distribution is normalized to the 
SM expectation. The distributions are shown only for the 
high S/ B class, which provides the majority of the sensitivity. 



sition of background events. The NN classifies events as 
signal or background based on the inputs j 
LRwwi LRzz, LRy^/j, Li?vK +jotsj ^'f'U: ^R-u, ^u, 
^<Pfj.,ieact), and -^T,rei> where A(/)« and ARu are the 
separation between the two leptons in (j) and AR, respec- 
tively. We find that the most discriminating input vari- 
ables are LRfJ^^/^r(,) , ARu, and $T.rei- example of 
the NN output is shown in Figure [TTb). The presented 
results use the neurobayes [13| program. Comparable 
results are also obtained using the tmva-MLP 1J| pro- 
gram, demonstrating the robustness of the technique. 

The systematic uncertainties associated with the 
Monte Carlo simulation affect all of the signal and back- 
ground expectations similarly, except for M^-|-jets. The 
uncertainties from the lepton selection and trigger effi- 
ciency measurements are propagated through the anal- 
ysis, giving uncertainties from 1.4% to 2.0% and 2.1% 
to 7.1% for the respective efficiencies of the different sig- 
nal mass points and background processes. The detector 
acceptance variation due to parton-distribution fuction 
(PDF) uncertainties is assessed to be 1.9% — 4.1% using 
the 20 pairs of PDF sets described in Ref. [31 ■ We as- 
sign a 6% luminosity uncertainty to the signal and back- 
ground estimates that are obtained from simulation [l6| . 
The cross section uncertainties are 10% for FFW [13], WZ 
[H, .^Z HI, and VF7 [i3, and 15% for tt Based 
on a comparison of simulated W^FK events generated with 



TABLE II: Expected and observed upper limits on (j(gg 
H) X B{H WW'*)) and a{gg H) x B{H 
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Mc@nlo and pythia, we assign a systematic uncertainty 
on the acceptance due to higher order QCD effects of 
5.5% for WW events and 10% for the other modes which 
are only simulated at leading order. 

The systematic uncertainty on the W-|-jets background 
is estimated to be 24% from differences in the observed 
probability that a jet is identified as a lepton for jets col- 
lected using different jet Et trigger thresholds. These 
variations correspond to changing the parton composi- 
tion of the jets and the relative amount of contamination 
from real leptons. Because only the $!xrei requirement 
suppresses the DY background, there is an uncertainty 
due to the l^rp resolution modeling, which is estimated 
to be 20% based on comparisons of the data and Monte 
Carlo simulation in a sample of dilepton events. For the 
W7 background contribution, there is an additional un- 
certainty of 20% from the detector material description 
and photon-conversion veto efficiency. 

A Bayesian credibility level (C.L.) is calculated for 
each rriH hypothesis based on the combined binned like- 
lihood of the discriminant distributions for the high and 
low S/B samples. A posterior density is obtained by 
multiplying this likelihood by Gaussian prior densities 
for the background normalizations and systematic un- 
certainties leaving a{gg H) x B{H WW'^*^) with 
a uniform prior density. A 95% C.L. limit is then de- 
termined such that 95% of the posterior density for 
(j{gg H)x B{H ^ WW'*)) falls below the Umit. Lim- 
its as a fraction of the SM cross section asuigg H) 
are calculated by including crsM(55 — *■ H) as a parameter 
whose prior density is determined by the 10% theoretical 
uncertainty of its next-to-next-to-leading-log prediction 
[1^ . The resulting cross section limits for both discrimi- 
nants are shown in Table |TT] and Figure [H 

In conclusion, we have presented limits on the produc- 
tion of a Higgs boson through gluon fusion followed by 
its decay to a pair of W bosons. A combination of ma- 
trix element and neural network techniques is used to 
discriminate signal from background. Studies using the 
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FIG. 2: Upper limits on a{gg H) x B{H 

WW'-'^)/asM{gg H)x BsmiH WW'-*^) as a function of 
niH- 

two techniques independently achieve consistent resuhs 
with a sensitivity approximately O.Ictsm worse than the 
combination at ttih = 160 GeV/c^. The consistency of 
results obtained with different algorithms provides evi- 
dence of the robustness of the multivariate techniques. 
At the most sensitive value of itih = 160 GeV/c^, the 
observed limit is 1.7 times the SM prediction where the 
median expected limit is 2.2, corresponding to a down- 
ward fluctuation slightly larger than one standard devi- 
ation. Compared to an optimized selection and a like- 
lihood based on the A(j)ii variable, the multivariate dis- 
criminators gain a factor of 1.7 to 2.5 in effective inte- 
grated luminosity depending on mn- This measurement 
also constrains alternative models in which the gg H 
coupling is enhanced by additional particles in the virtual 
loops of the production amplitude (20j . 
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